185 research outputs found

    A Strategy analysis for genetic association studies with known inbreeding

    Get PDF
    Background: Association studies consist in identifying the genetic variants which are related to a specific disease through the use of statistical multiple hypothesis testing or segregation analysis in pedigrees. This type of studies has been very successful in the case of Mendelian monogenic disorders while it has been less successful in identifying genetic variants related to complex diseases where the insurgence depends on the interactions between different genes and the environment. The current technology allows to genotype more than a million of markers and this number has been rapidly increasing in the last years with the imputation based on templates sets and whole genome sequencing. This type of data introduces a great amount of noise in the statistical analysis and usually requires a great number of samples. Current methods seldom take into account gene-gene and gene-environment interactions which are fundamental especially in complex diseases. In this paper we propose to use a non-parametric additive model to detect the genetic variants related to diseases which accounts for interactions of unknown order. Although this is not new to the current literature, we show that in an isolated population, where the most related subjects share also most of their genetic code, the use of additive models may be improved if the available genealogical tree is taken into account. Specifically, we form a sample of cases and controls with the highest inbreeding by means of the Hungarian method, and estimate the set of genes/environmental variables, associated with the disease, by means of Random Forest. Results: We have evidence, from statistical theory, simulations and two applications, that we build a suitable procedure to eliminate stratification between cases and controls and that it also has enough precision in identifying genetic variants responsible for a disease. This procedure has been successfully used for the betathalassemia, which is a well known Mendelian disease, and also to the common asthma where we have identified candidate genes that underlie to the susceptibility of the asthma. Some of such candidate genes have been also found related to common asthma in the current literature. Conclusions: The data analysis approach, based on selecting the most related cases and controls along with the Random Forest model, is a powerful tool for detecting genetic variants associated to a disease in isolated populations. Moreover, this method provides also a prediction model that has accuracy in estimating the unknown disease status and that can be generally used to build kit tests for a wide class of Mendelian diseases

    Participation bias in the UK Biobank distorts genetic associations and downstream analyses

    Get PDF
    While volunteer-based studies such as the UK Biobank have become the cornerstone of genetic epidemiology, the participating individuals are rarely representative of their target population. To evaluate the impact of selective participation, here we derived UK Biobank participation probabilities on the basis of 14 variables harmonized across the UK Biobank and a representative sample. We then conducted weighted genome-wide association analyses on 19 traits. Comparing the output from weighted genome-wide association analyses (neffective = 94,643 to 102,215) with that from standard genome-wide association analyses (n = 263,464 to 283,749), we found that increasing representativeness led to changes in SNP effect sizes and identified novel SNP associations for 12 traits. While heritability estimates were less impacted by weighting (maximum change in h2, 5%), we found substantial discrepancies for genetic correlations (maximum change in rg, 0.31) and Mendelian randomization estimates (maximum change in βSTD, 0.15) for socio-behavioural traits. We urge the field to increase representativeness in biobank samples, especially when studying genetic correlates of behaviour, lifestyles and social outcomes

    Mendelian randomisation identifies priority groups for prophylactic EBV vaccination

    Get PDF
    BACKGROUND: Epstein Barr virus (EBV) infects ~ 95% of the population worldwide and is known to cause adverse health outcomes such as Hodgkin’s, non-Hodgkin’s lymphomas, and multiple sclerosis. There is substantial interest and investment in developing infection-preventing vaccines for EBV. To effectively deploy such vaccines, it is vital that we understand the risk factors for infection. Why particular individuals do not become infected is currently unknown. The current literature, describes complex, often conflicting webs of intersecting factors—sociodemographic, clinical, genetic, environmental-, rendering causality difficult to decipher. We aimed to use Mendelian randomization (MR) to overcome the issues posed by confounding and reverse causality to determine the causal risk factors for the acquisition of EBV. METHODS: We mapped the complex evidence from the literature prior to this study factors associated with EBV serostatus (as a proxy for infection) into a causal diagram to determine putative risk factors for our study. Using data from the UK Biobank of 8422 individuals genomically deemed to be of white British ancestry between the ages of 40 and 69 at recruitment between the years 2006 and 2010, we performed a genome wide association study (GWAS) of EBV serostatus, followed by a Two Sample MR to determine which putative risk factors were causal. RESULTS: Our GWAS identified two novel loci associated with EBV serostatus. In MR analyses, we confirmed shorter time in education, an increase in number of sexual partners, and a lower age of smoking commencement, to be causal risk factors for EBV serostatus. CONCLUSIONS: Given the current interest and likelihood of a future EBV vaccine, these factors can inform vaccine development and deployment strategies by completing the puzzle of causality. Knowing these risk factors allows identification of those most likely to acquire EBV, giving insight into what age to vaccinate and who to prioritise when a vaccine is introduced

    Microsatellites and SNPs linkage analysis in a Sardinian genetic isolate confirms several essential hypertension loci previously identified in different populations

    Get PDF
    Background. A multiplicity of study designs such as gene candidate analysis, genome wide search (GWS) and, recently, whole genome association studies have been employed for the identification of the genetic components of essential hypertension (EH). Several genome-wide linkage studies of EH and blood pressure-related phenotypes demonstrate that there is no single locus with a major effect while several genomic regions likely to contain EH-susceptibility loci were validated by multiple studies. Methods. We carried out the clinical assessment of the entire adult population in a Sardinian village (Talana) and we analyzed 16 selected families with 62 hypertensive subjects out of 267 individuals. We carried out a double GWS using a set of 902 uniformly spaced microsatellites and a high-density SNPs map on the same group of families. Results. Three loci were identified by both microsatellites and SNP scans and the obtained linkage results showed a remarkable degree of similarity. These loci were identified on chromosome 2q24, 11q23.1–25 and 13q14.11–21.33. Further support to these findings is their broad description present in literature associated to EH or related phenotypes. Bioinformatic investigation of these loci shows several potential EH candidate genes, several of whom already associated to blood pressure regulation pathways. Conclusion. Our search for major susceptibility EH genetic factors evidences that EH in the genetic isolate of Talana is due to the contribution of several genes contained in loci identified and replicated by earlier findings in different human populations

    Genome-wide meta-analysis associates HLA-DQA1/DRB1 and LPA and lifestyle factors with human longevity

    Get PDF
    Genomic analysis of longevity offers the potential to illuminate the biology of human aging. Here, using genome-wide association meta-analysis of 606,059 parents’ survival, we discover two regions associated with longevity (HLA- DQA1/DRB1 and LPA). We also validate previous suggestions that APOE, CHRNA3/5, CDKN2A/B, SH2B3 and FOXO3A influence longevity. Next we show that giving up smoking, educational attainment, openness to new experience and high-density lipoprotein (HDL) cholesterol levels are most positively genetically correlated with lifespan while susceptibility to coronary artery disease (CAD), cigarettes smoked per day, lung cancer, insulin resistance and body fat are most negatively correlated. We suggest that the effect of education on lifespan is principally mediated through smoking while the effect of obesity appears to act via CAD. Using instrumental variables, we suggest that an increase of one body mass index unit reduces lifespan by 7 months while 1 year of education adds 11 months to expected lifespan

    Identification of genetic loci simultaneously associated with multiple cardiometabolic traits

    Get PDF
    Background and aims: Cardiometabolic disorders (CMD) arise from a constellation of features such as increased adiposity, hyperlipidemia, hypertension and compromised glucose control. Many genetic loci have shown associations with individual CMD-related traits, but no investigations have focused on simultaneously identifying loci showing associations across all domains. We therefore sought to identify loci associated with risk across seven continuous CMD-related traits. Methods and results: We conducted separate genome-wide association studies (GWAS) for systolic and diastolic blood pressure (SBP/DBP), hemoglobin A1c (HbA1c), low- and high- density lipoprotein cholesterol (LDL-C/HDL-C), waist-to-hip-ratio (WHR), and triglycerides (TGs) in the UK Biobank (N = 356,574–456,823). Multiple loci reached genome-wide levels of significance (N = 145–333) for each trait, but only four loci (in/near VEGFA, GRB14-COBLL1, KLF14, and RGS19-OPRL1) were associated with risk across all seven traits (P < 5 × 10−8). We sought replication of these four loci in an independent set of seven trait-specific GWAS meta-analyses. GRB14-COBLL1 showed the most consistent replication, revealing nominally significant associations (P < 0.05) with all traits except DBP. Conclusions: Our analyses suggest that very few loci are associated in the same direction of risk with traits representing the full spectrum of CMD features. We identified four such loci, and an understanding of the pathways between these loci and CMD risk may eventually identify factors that can be used to identify pathologic disturbances that represent broadly beneficial therapeutic targets.National Institutes of Health12 month embargo; published: 07 January 2022This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at [email protected]

    Genetic analyses identify widespread sex-differential participation bias

    Get PDF
    Genetic analyses identify widespread sex-differential participation bias in population-based studies and show how this bias can lead to incorrect inferences. These findings highlight new challenges for association studies as sample sizes continue to grow. Genetic association results are often interpreted with the assumption that study participation does not affect downstream analyses. Understanding the genetic basis of participation bias is challenging since it requires the genotypes of unseen individuals. Here we demonstrate that it is possible to estimate comparative biases by performing a genome-wide association study contrasting one subgroup versus another. For example, we showed that sex exhibits artifactual autosomal heritability in the presence of sex-differential participation bias. By performing a genome-wide association study of sex in approximately 3.3 million males and females, we identified over 158 autosomal loci spuriously associated with sex and highlighted complex traits underpinning differences in study participation between the sexes. For example, the body mass index-increasing allele at FTO was observed at higher frequency in males compared to females (odds ratio = 1.02, P = 4.4 x 10(-)(36)). Finally, we demonstrated how these biases can potentially lead to incorrect inferences in downstream analyses and propose a conceptual framework for addressing such biases. Our findings highlight a new challenge that genetic studies may face as sample sizes continue to grow.Peer reviewe

    Mendelian Randomization Identifies the Potential Causal Impact of Dietary Patterns on Circulating Blood Metabolites

    Get PDF
    Nutrition plays an important role in the development and progress of several health conditions, but the exact mechanism is often still unclear. Blood metabolites are likely candidates to be mediating these relationships, as their levels are strongly dependent on the frequency of consumption of several foods/drinks. Understanding the causal effect of food on metabolites is thus of extreme importance. To establish these effects, we utilized two-sample Mendelian randomization using the genetic variants associated with dietary traits as instrumental variables. The estimates of single-nucleotide polymorphisms’ effects on exposures were obtained from a recent genome-wide association study (GWAS) of 25 individual and 15 principal-component dietary traits, whereas the ones for outcomes were obtained from a GWAS of 123 blood metabolites measured by nuclear magnetic resonance spectroscopy. We identified 413 potentially causal links between food and metabolites, replicating previous findings, such as the association between increased oily fish consumption and higher DHA, and highlighting several novel associations. Most of the associations were related to very-low-density, intermediate-density (IDL), and low-density lipoproteins (LDL). For example, we found that constituents of IDL particles and large LDL particles were raised by coffee and alcohol while lowered by an overall healthier diet and fruit consumption. Our findings provide a strong base of evidence for planning future RCTs aimed at understanding the role of diet in determining blood metabolite levels

    Physician-Confirmed and Administrative Definitions of Stroke in UK Biobank Reflect the Same Underlying Genetic Trait

    Get PDF
    BACKGROUND: Stroke in UK Biobank (UKB) is ascertained via linkages to coded administrative datasets and self-report. We studied the accuracy of these codes using genetic validation. METHODS: We compiled stroke-specific and broad cerebrovascular disease (CVD) code lists (Read V2/V3, ICD-9/-10) for medical settings (hospital, death record, primary care) and self-report. Among 408,210 UKB participants, we identified all with a relevant code, creating 12 stroke definitions based on the code type and source. We performed genome-wide association studies (GWASs) for each definition, comparing summary results against the largest published stroke GWAS (MEGASTROKE), assessing genetic correlations, and replicating 32 stroke-associated loci. RESULTS: The stroke case numbers identified varied widely from 3,976 (primary care stroke-specific codes) to 19,449 (all codes, all sources). All 12 UKB stroke definitions were significantly correlated with the MEGASTROKE summary GWAS results (rg.81-1) and each other (rg.4-1). However, Bonferroni-corrected confidence intervals were wide, suggesting limited precision of some results. Six previously reported stroke-associated loci were replicated using ≥1 UKB stroke definition. CONCLUSIONS: Stroke case numbers in UKB depend on the code source and type used, with a 5-fold difference in the maximum case-sample size. All stroke definitions are significantly genetically correlated with the largest stroke GWAS to date

    More than smell - COVID-19 is associated with severe impairment of smell, taste, and chemesthesis

    Get PDF
    Recent anecdotal and scientific reports have provided evidence of a link between COVID-19 and chemosensory impairments such as anosmia. However, these reports have downplayed or failed to distinguish potential effects on taste, ignored chemesthesis, and generally lacked quantitative measurements. Here, we report the development, implementation and initial results of a multi-lingual, international questionnaire to assess self-reported quantity and quality of perception in three distinct chemosensory modalities (smell, taste, and chemesthesis) before and during COVID-19. In the first 11 days after questionnaire launch, 4039 participants (2913 women, 1118 men, 8 other, ages 19-79) reported a COVID-19 diagnosis either via laboratory tests or clinical assessment. Importantly, smell, taste and chemesthetic function were each significantly reduced compared to their status before the disease. Difference scores (maximum possible change ±100) revealed a mean reduction of smell (-79.7 ± 28.7, mean ± SD), taste (-69.0 ± 32.6), and chemesthetic (-37.3 ± 36.2) function during COVID-19. Qualitative changes in olfactory ability (parosmia and phantosmia) were relatively rare and correlated with smell loss. Importantly, perceived nasal obstruction did not account for smell loss. Furthermore, chemosensory impairments were similar between participants in the laboratory test and clinical assessment groups. These results show that COVID-19-associated chemosensory impairment is not limited to smell, but also affects taste and chemesthesis. The multimodal impact of COVID-19 and lack of perceived nasal obstruction suggest that SARS-CoV-2 infection may disrupt sensory-neural mechanisms.info:eu-repo/semantics/acceptedVersio
    • …
    corecore